Computer-aided Detection: The Impact of Machine Learning Classifier and Image Feature Selection on Scheme Performance

نویسندگان

  • Xingwei Wang
  • Dror Lederman
  • Jun Tan
  • Bin Zheng
چکیده

Computer-aided detection and diagnosis (CAD) schemes have been developed and applied to detect suspicious lesions depicted on biomedical images. After identifying initial candidates for the targeted suspicious lesions, most CAD schemes use a pre-trained multi-image-feature based machine learning classifier to classify these candidates into two groups of positive and negative detections. Although a large number of image features and machine learning classifiers have been developed and tested using different image databases, selecting the optimal image features and a machine learning classifier remains a challenged issue in CAD development. In this study, we assembled two independent image datasets for training and testing. We optimized four machine learning classifiers (namely, artificial neural network, support vector machine, Bayesian belief network, and k-nearest neighbor algorithm), which were trained and tested using the same dataset with two sets of image features. The results showed that using the first feature set, the case-based classification performance of four classifiers measured with the normalized areas under FROC-type performance curves (AUCs) ranged from 0.925 to 0.943 without statistically significant difference (p > 0.05). When using the second image feature set, AUC values of four classifies significantly reduced to the range from 0.886 to 0.903 (p < 0.01). This study suggested that although these four classifiers were built based on different machine learning concepts, their actual performance levels were likely to converge to the similar level when using the same image features and an independent testing dataset. Thus, selecting image features rather than a machine learning classifier plays a more important role in determining CAD performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Anomaly Detection Using SVM as Classifier and Decision Tree for Optimizing Feature Vectors

Abstract- With the advancement and development of computer network technologies, the way for intruders has become smoother; therefore, to detect threats and attacks, the importance of intrusion detection systems (IDS) as one of the key elements of security is increasing. One of the challenges of intrusion detection systems is managing of the large amount of network traffic features. Removing un...

متن کامل

Improving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering

Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...

متن کامل

A Random Forest Classifier based on Genetic Algorithm for Cardiovascular Diseases Diagnosis (RESEARCH NOTE)

Machine learning-based classification techniques provide support for the decision making process in the field of healthcare, especially in disease diagnosis, prognosis and screening. Healthcare datasets are voluminous in nature and their high dimensionality problem comprises in terms of slower learning rate and higher computational cost. Feature selection is expected to deal with the high dimen...

متن کامل

Intrusion Detection based on a Novel Hybrid Learning Approach

Information security and Intrusion Detection System (IDS) plays a critical role in the Internet. IDS is an essential tool for detecting different kinds of attacks in a network and maintaining data integrity, confidentiality and system availability against possible threats. In this paper, a hybrid approach towards achieving high performance is proposed. In fact, the important goal of this paper ...

متن کامل

A Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection

Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJIIP

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2010